Web Sessions Clustering with Artificial Ants Colonies

نویسندگان

  • Nicolas Labroche
  • Nicolas Monmarché
  • Gilles Venturini
چکیده

In this paper, we present AntClust, an ant based clustering algorithm and its application to the Web usage mining problem. We define a Web session as a weighted multi-modal vector and we also develop a similarity measure between two sessions. We show that the partitions found by AntClust are stable on a data set made of real sessions extracted from a Web site of the University of Tours. Contrary to some other studies, we do not only consider the transactions model to describe the sessions. We show that our algorithm performs well and is able to find non-noisy clusters when dealing with sessions defined by a vector containing the number of hits recorded for each of the Web page.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AntClust: Ant Clustering and Web Usage Mining

In this paper, we propose a new ant-based clustering algorithm called AntClust. It is inspired from the chemical recognition system of ants. In this system, the continuous interactions between the nestmates generate a “Gestalt” colonial odor. Similarly, our clustering algorithm associates an object of the data set to the odor of an ant and then simulates meetings between ants. At the end, artif...

متن کامل

AntTree: A Web Document Clustering Using Artificial Ants

We present in this work a new algorithm for document hierarchical clustering and automatic generation of portals sites. This model is inspired from the self-assembling behavior observed in real ants where ants progressively get attached to an existing support and successively to other attached ants. The artificial ants that we have defined will similarly build a tree. Each ant represents one do...

متن کامل

Learning Web Users Profiles With Relational Clustering Algorithms

In the context of web personalization and dynamic content recommendation, it is crucial to learn typical user profiles. Although there exists several approaches to mine user profiles (such as association rules or sequential patterns extraction), this paper focuses on the application of relational clustering algorithms on web usage data to characterize user access profiles. These methods rely on...

متن کامل

Predicting web user behavior using learning-based ant colony optimization

An ant colony optimization-based algorithm to predict web usage patterns is presented. Our methodology incorporates multiple data sources, such as web content and structure, as well as web usage. The model is based on a continuous learning strategy based on previous usage in which artificial ants try to fit their sessions with real usage through the modification of a text preference vector. Sub...

متن کامل

Application of Ant-based Template Matching for Web Documents Categorization

The self-organization behavior exhibited by ants may be modeled to solve real world clustering problems. The general idea of artificial ants walking around in search space to pick up, or drop an item based upon some probability measure has been examined to cluster a large number of World Wide Web (WWW) documents. However, this idea is extended with the direct application of template matching wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003